Combining Per-frame and Per-track Cues for Multi-person Action Recognition

نویسندگان

Sameh Khamis

Vlad I. Morariu

Larry S. Davis

چکیده

We propose a model to combine per-frame and per-track cues for action recognition. With multiple targets in a scene, our model simultaneously captures the natural harmony of an individual’s action in a scene and the flow of actions of an individual in a video sequence, inferring valid tracks in the process. Our motivation is based on the unlikely discordance of an action in a structured scene, both at the track level and the frame level (e.g ., a person dancing in a crowd of joggers). While we can utilize sampling approaches for inference in our model, we instead devise a global inference algorithm by decomposing the problem and solving the subproblems exactly and efficiently, recovering a globally optimal joint solution in several cases. Finally, we improve on the stateof-the-art action recognition results for two publicly available datasets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Leveraging Structure in Activity Recognition: Context and Spatiotemporal Dynamics

Title of dissertation: LEVERAGING STRUCTURE IN ACTIVITY RECOGNITION: CONTEXT AND SPATIOTEMPORAL DYNAMICS Sameh Khamis, Doctor of Philosophy, 2015 Dissertation directed by: Larry S. Davis Department of Computer Science Activity recognition is one of the fundamental problems of computer vision. An activity recognition system aims to identify the actions of humans from an image or a video. This pr...

متن کامل

Ragdolls in Action – Action Recognition by 3d Pose Recovery from Monocular Video

We present a novel approach to reconstruct and track articulated objects, specifically humans, in 3D from monocular videos for action recognition, by combining techniques from both image processing and 3D computer animation. The goal is to establish a system that is able to recognize basic actions (like walk, run) from frame to frame in a scene with more than one person. In a first step a featu...

متن کامل

Action Change Detection in Video Based on HOG

Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition problem. Many of these frameworks suppose that each video sequence includes only one ...

متن کامل

Continuous Action Recognition by Action-specific Motion Models

This paper proposes the models of human motion prior with multiple actions for action recognition in videos. A training sequence of each action, such as walking and jogging, is separately recorded by a motion capture system and modeled independently. Unlike existing approaches with similar motion prior models, our method uses the multiple models simultaneously for particle filtering in order to...

متن کامل

Ear Recognition from One Sample Per Person

Biometrics has the advantages of efficiency and convenience in identity authentication. As one of the most promising biometric-based methods, ear recognition has received broad attention and research. Previous studies have achieved remarkable performance with multiple samples per person (MSPP) in the gallery. However, most conventional methods are insufficient when there is only one sample per ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Combining Per-frame and Per-track Cues for Multi-person Action Recognition

نویسندگان

چکیده

منابع مشابه

Leveraging Structure in Activity Recognition: Context and Spatiotemporal Dynamics

Ragdolls in Action – Action Recognition by 3d Pose Recovery from Monocular Video

Action Change Detection in Video Based on HOG

Continuous Action Recognition by Action-specific Motion Models

Ear Recognition from One Sample Per Person

عنوان ژورنال:

اشتراک گذاری